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Abstract 

Long non-coding RNAs (IncRNAs) are emerging as new players in the cancer paradigm demonstrating potential 
roles in both oncogenic and tumor suppressive pathways. These novel genes are frequently aberrantly expressed 
in a variety of human cancers, however the biological functions of the vast majority remain unknown. Recently, 
evidence has begun to accumulate describing the molecular mechanisms by which these RNA species function, 
providing insight into the functional roles they may play in tumorigenesis. In this review, we highlight the 
emerging functional role of IncRNAs in human cancer. 



Introduction 

One of modern biology's great surprises was the discovery 
that the human genome encodes only -20,000 protein- 
coding genes, representing <2% of the total genome 
sequence [1,2]. However, with the advent of tiling resolu- 
tion genomic microarrays and whole genome and tran- 
scriptome sequencing technologies it was determined that 
at least 90% of the genome is actively transcribed [3,4]. 
The human transcriptome was found to be more complex 
than a collection of protein-coding genes and their splice 
variants; showing extensive antisense, overlapping and 
non-coding RNA (ncRNA) expression [5-10]. Although 
initially argued to be spurious transcriptional noise, recent 
evidence suggests that the proverbial "dark matter" of the 
genome may play a major biological role in cellular devel- 
opment and metabolism [11-17]. One such player, the 
newly discovered long non-coding RNA (IncRNA) genes, 
demonstrate developmental and tissue specific expression 
patterns, and aberrant regulation in a variety of diseases, 
including cancer [18-27]. 

NcRNAs are loosely grouped into two major classes 
based on transcript size; small ncRNAs and IncRNAs 
(Table 1) [28-30]. Small ncRNAs are represented by a 
broad range of known and newly discovered RNA species, 
with many being associated with 5' or 3' regions of genes 
[4,31,32]. This class includes the well-documented miR- 
NAs, RNAs -22 nucleotides (nt) long involved in the spe- 
cific regulation of both protein-coding, and putatively 
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non-coding genes, by post-transcriptional silencing or 
infrequently by activation [33-35]. miRNAs serve as major 
regulators of gene expression and as intricate components 
of the cellular gene expression network [33-38]. Another 
newly described subclass are the transcription initiation 
RNAs (tiRNAs), which are the smallest functional RNAs 
at only 18 nt in length [39,40]. While a number of small 
ncRNAs classes, including miRNAs, have established roles 
in tumorigenesis, an intriguing association between the 
aberrant expression of ncRNA satellite repeats and cancer 
has been recently demonstrated [41-46]. 

In contrast to miRNAs, IncRNAs, the focus of this arti- 
cle, are mRNA-like transcripts ranging in length from 
200 nt to -100 kilobases (kb) lacking significant open 
reading frames. Many identified IncRNAs are transcribed 
by RNA polymerase II (RNA pol II) and are polyadeny- 
lated, but this is not a fast rule [47,48]. There are exam- 
ples of IncRNAs, such as the antisense asOct4-pgS or the 
brain-associated BC200, which are functional, but not 
polyadenylated [49-51]. Generally, IncRNA expression 
levels appear to be lower than protein-coding genes 
[52-55], and some IncRNAs are preferentially expressed 
in specific tissues [21]. However, recent findings have 
suggested novel IncRNAs may contribute a significant 
portion of the aforementioned 'dark matter' of the 
human transcriptome [56,57]. In an exciting report by 
Kapranov etal, it was revealed the bulk of the relative 
mass of RNA in a human cell, exclusive of the ribosomal 
and mitochondrial RNA, is represented by non-coding 
transcripts with no known function [57]. 
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Table 1 Types of human non-coding RNAs 



Type 


Subclasses 


Symbol 


References 




Transfer RNAs 


tRNAs 


[222] 




MicroRNAs 


miRNAs 


[33,34] 




Ribosomal 5S and 5.8S RNAs 


rRNAs 


[223,224] 




Piwi interacting RNAs 


piRNAs 


[38,225] 




Tiny transcription initiation RNAs 


tiRNAs 


[39,40] 




Small interfering RNAs 


siRNA 


[32] 




Promoter-associated short RNAs 


PASRs 


[144J49] 




Termini-associated short RNAs 


TASRs 


[144 149] 




Antisense termini associated short RNAs 


aTASRs 


[226] 


Small ncRNA (18 to 200 nt in size) 


Small nucleolar RNAs 


snoRNAs 


[227,228] 




Transcription start site antisense RNAs 


TSSa-RNAs 


[229] 




Small nuclear RNAs 


snRNAs 


[230] 




Retrotransposon-derived RNAs 


RE-RNAs 


[231,232] 




3'UTR-derived RNAs 


uaRNAs 


[145] 




x-ncRNA 


x-ncRNA 


[233] 




Human Y RNA 


hY RNA 


[234] 




Unusually small RNAs 


usRNAs 


[235] 




Small NF90-associated RNAs 


snaRs 


[236,237] 




Vault RNAs 


vtRNAs 


[238] 




Ribosomal 18S and 28S RNAs 


rRNAs 


[223,224] 




Long or large intergenic ncRNAs 


lincRNAs 


[8,58] 




Transcribed ultraconserved regions 


T-UCRs 


[85] 




Pseudogenes 


none 


[239,240] 




GAA-repeat containing RNAs 


GRC-RNAs 


[241] 


Long ncRNA (IncRNAs, 200 nt to > 100 kb in size) 


Long intronic ncRNAs 


none 


[242,243] 




Antisense RNAs 


aRNAs 


[244] 




Promoter-associated long RNAs 


PALRs 


[144,149] 




Promoter upstream transcripts 


PROMPTS 


[245] 




Stable excised intron RNAs 


none 


[56] 




Long stress-induced non-coding transcripts 


LSINCTs 


[25] 



Like miRNAs and protein-coding genes, some tran- 
scriptionally active IncRNA genes display histone H3K4 
trimethylation at their 5'-end and histone H3K36 tri- 
methylation in the body of the gene [8,58,59]. The small 
number of characterized human IncRNAs have been 
associated with a spectrum of biological processes, for 
example, epigenetics, alternative splicing, nuclear 
import, as structural components, as precursors to small 
RNAs and even as regulators of mRNA decay [4,60-70]. 
Furthermore, accumulating reports of misregulated 
IncRNA expression across numerous cancer types sug- 
gest that aberrant IncRNA expression may be a major 
contributor to tumorigenesis [71]. This surge in publica- 
tions reflects the increasing attention to this subject 
(Figure 1) and a number of useful IncRNA databases 
have been created (Table 2). In this review we highlight 
the emerging functional role of aberrant IncRNA expres- 
sion, including transcribed ultraconserved regions 
(T-UCRs), within human carcinomas. 
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Year of Publication 
Figure 1 Publications describing cancer-associated ncRNAs 

Entries are based on a National Library of Medicine Pubmed search 
using the terms "ncRNA" or "non-coding RNA" or "noncoding RNA" 
or non-protein-coding RNA" with cancer and annual (Jan.1-Dec.31) 
date limitations. 
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Table 2 Publically available long non-coding RNA online databases 


Database Name 


Website 


Reference 


ncRNAimprint 


http://rnaqueen.sysu.edu.cn/ncRNAimprint/ 


[246] 


ncRNAdb 


http://research.imb.uq.edu.au/rnadb/ 


[247] 


Functional RNAdb 


http://www.ncrna.org/frnadb/ 


[248] 


NONCODE 


http://www.noncode.org/ 


[249] 


IncRNA db 


http://longnoncodingrna.com/ 


[250] 


Rfam 


http://rfam.sanger.ac.uk/ 


[251] 


NRED 


http://jsm-research.imb.uq.edu.au/nred/cgi-bin/ncrnadb.pl 


[252] 


Ncode (Invitrogen) 


http://escience.invitrogen.com/ncRNA/ 


N/A 


NcRNA Database 


http://biobases.ibch.poznan.pl/ncRNA/ 


[253] 


T-UCRs 


http://users.soe.ucsc.edu/~jill/ultra.html 


[202] 


NPInter 


http://www.bioi nfo.org .cn/N PI nter/ 


[254] 



Background on Long Non-Coding RNA 

Nomenclature and Classification of LncRNA 

The definition 'non-coding RNA' is typically used to 
describe transcripts where sequence analysis has failed 
to identify an open reading frame. However, one should 
exercise caution when exploring putative non-coding 
transcripts, as there are cases where 'non-coding' tran- 
scripts were found to encode short, functional peptides 
[72]. Currently, a universal classification scheme to 
define IncRNAs does not exist and therefore there are 
numerous synonyms describing the same type of tran- 
scripts. Terms such as large non-coding RNA, mRNA- 
like long RNA, and intergenic RNA all define cellular 
RNAs, exclusive of rRNAs, greater than 200 nt in length 
and having no obvious protein-coding capacity [62]. 
Consequently, this has led to confusion in the literature 
as to exactly which transcripts should constitute a 
IncRNA. For example, one subclass of IncRNAs is called 
large or long intergenic ncRNAs (lincRNAs). These 
IncRNAs are exclusively intergenic and are marked by a 
chromatin signature indicative of transcription [8,58]. 
To further compound classification confusion, RNA spe- 
cies that are bifunctional preclude categorization into 
either group of protein-coding or ncRNAs as their tran- 
scripts function both at the RNA and protein levels [73]. 
In these rare cases, they are classified on a case-by-case 
basis, reserving the term 'IncRNA' to describe tran- 
scripts with no protein-coding capacity. In the mean- 
time, and for the purposes of this review, we will 
consider IncRNAs as a blanket term to encompass 
mRNA-like ncRNAs, lincRNAs, as well as antisense and 
intron-encoded transcripts, T-UCRs and transcribed 
pseudogenes. 

Discovery of LncRNAs 

The earliest reports describing IncRNA predated the dis- 
covery of miRNAs, although the term IncRNA' had not 
been coined at the time (Figure 2). One of the first 
IncRNA genes reported was the imprinted HI 9 gene, 



which was quickly followed by the discovery of the 
silencing X-inactive-specific transcript (XIST) IncRNA 
gene, which plays a critical function in X-chromosome 
inactivation [74,75]. However, the discovery of the first 
miRNA lin-14 dramatically redirected the focus of 
ncRNA research from long ncRNAs to miRNAs [76]. 
Moreover, during this time the discovery of miRNAs 
revealed RNA could regulate gene expression and later 
that entire gene networks could be affected by ncRNA 
expression and within the last decade miRNAs were dis- 
covered to be associated with cancer (Figure 2) [76-79]. 
At the time of this writing there are approximately 1049 
human miRNAs described in miRBase V16 [80,81] with 
the potential of affecting the expression of approxi- 
mately 60% of protein -coding genes [82,83]. Conversely, 
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Figure 2 Timeline of cancer-associated ncRNA discoveries 
relative to transcriptome analysis technologies (not drawn to 
scale). 
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the variety and dynamics of IncRNA expression was not 
to be fully appreciated until the introduction of whole 
transcriptome sequencing. With the advent of the FAN- 
TOM and ENCODE transcript mapping projects, it was 
revealed that the mammalian genome is extensively 
transcribed, although a large portion of this represented 
non-coding sequences [3,84]. Coupled with the novel 
functional annotation of a few IncRNAs, this discovery 
promoted research focusing on IncRNA discovery and 
characterization. Recent reports have described new 
IncRNA classes such as lincRNAs and T-UCRs [8,58,85]. 
Current estimates of the IncRNA gene content in the 
human genome ranges from -7000 - 23,000 unique 
IncRNAs, implying this class of ncRNA will represent an 
enormous, yet undiscovered, component of normal cel- 
lular networks that may be disrupted in cancer biology 
[62]. 

Emerging Role of Long Non-Coding RNA in 
Tumorigenesis 

A role for differential IncRNA expression in cancer had 
been suspected for many years, however, lacked strong 
supporting evidence [86]. With advancements in cancer 



transcriptome profiling and accumulating evidence sup- 
porting IncRNA function, a number of differentially 
expressed IncRNAs have been associated with cancer 
(Table 3). LncRNAs have been implicated to regulate a 
range of biological functions and the disruption of some of 
these functions, such as genomic imprinting and transcrip- 
tional regulation, plays a critical role in cancer develop- 
ment. Here we describe some of the better characterized 
IncRNAs that have been associated with cancer biology. 
Imprinted IncRNA genes 

Imprinting is a process whereby the copy of a gene inher- 
ited from one parent is epigenetically silenced [87,88]. 
Intriguingly, imprinted regions often include multiple 
maternal and paternally expressed genes with a high fre- 
quency of ncRNA genes. The imprinted ncRNA genes 
are implicated in the imprinting of the region by a variety 
of mechanisms including enhancer competition and 
chromatin remodeling [89]. A key feature of cancer is the 
loss of this imprinting resulting in altered gene expres- 
sion [90,91]. Two of the best known imprinted genes are 
in fact IncRNAs. 

HI 9 The HI 9 gene encodes a 2.3 kb IncRNA that is 
expressed exclusively from the maternal allele. H19 and 



Table 3 Human cancer-associated IncRNAs 



LncRNA 


Size 


Cytoband 


Cancer types 


References 


HOTAIR 


2158 nt 


12q13.13 


Breast 


[18,68] 


MALATl/a/NEAT2 


7.5 kb 


11q13.1 


Breast, lung, uterus, pancreas, colon, prostate, liver, cervix 1 , 
neuroblastoma 1 , osteosarcoma 


[135,137-139,152,255,256] 


HULC 


500 nt 


6p24.3 


Liver, hepatic colorectal metastasis 


[170,171] 


BC200 


200 nt 


2p21 


Breast, cervix, esophagus, lung, ovary, parotid, tongue 


[50,51] 


H19 


2.3 kb 


1 1 pi 5.5 


Bladder, lung, liver, breast, endometrial, cervix, esophagus, ovary, 
prostate, choricarcinoma, colorectal 


[74,92,95,97,102,103,257-264] 


BIC/MIRHG155/ 
MIRHG2 


1.6 kb 


21q1 1.2 


B-cell lymphoma 


[153] 


PR NCR! 


13 kb 


8q24.2 


Prostate 


[187] 


LOC285194 


2105 nt 


3q13.31 


Osteosarcoma 


[265] 


PCGEM1 


1643 nt 


2q32.2 


Prostate 


[188,266,267] 


UCA1/CUDR 


1 .4 kb, 2.2 kb, 2.7 
kb 


1 9p1 3.1 2 


Bladder, colon, cervix, lung, thyroid, liver, breast, esophagus, stomach 


[268-270] 


DD3/PCA3 


0.6 kb, 2 kb, 4 kb 


9q21.22 


Prostate 


[189,190] 


anti-N0S2A 


-1.9 kb 


17q23.2 


Brain 1 


[271] 


uc.73A 


201 nt 


2q22.3 


Colon 


[200] 


TUC338 (encodes 
uc.338) 


590 nt 


12q13.13 


Liver 


[203] 


ANRIL/pl5AS/ 
CDK2BAS 


34.8 kb & splice 
variants 


9p21.3 


Prostate, leukemia 


[175,176,183,272] 


MEG3 


1.6 kb & splicing 
isoforms 


14q32.2 


Brain (downregulated) 


[156-158,162] 


GAS5/SNHG2 


Multiple isoforms 


1q25.1 


Breast (downregulated) 


[273] 


SRA-1/SRA 
(bifunctional) 


1965 nt 


5q31.3 


Breast, uterus, ovary (hormone responsive tissue) 


[274,275] 


PTENP1 


-3.9 kb 


9p13.3 


Prostate 


[173,174] 


ncRAN 


2186 nt, 2087 nt 


17q25.1 


Bladder, neuroblastoma 


[276,277] 



1 Cell lines. 
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its reciprocally imprinted protein-coding neighbor the 
Insulin-Like Growth Factor 2 or IGF2 gene at lip 15. 5 
were among the first genes, non-coding or otherwise, 
found to demonstrate genomic imprinting [74,92]. The 
expression of HI 9 is high during vertebrate embryo devel- 
opment, but is downregulated in most tissues shortly after 
birth with the exception of skeletal tissue and cartilage 
[20,93,94]. Loss of imprinting and subsequent strong gene 
expression has been well-documented in human cancers. 
Likewise, loss of imprinting at the HI 9 locus resulted in 
high HI 9 expression in cancers of the esophagus, colon, 
liver, bladder and with hepatic metastases [95-97]. 

HI 9 has been implicated as having both oncogenic and 
tumor suppression properties in cancer. HI 9 is upregu- 
lated in a number of human cancers, including hepato- 
cellular, bladder and breast carcinomas, suggesting an 
oncogenic function for this IncRNA [97-99]. In colon 
cancer HI 9 was shown to be directly activated by the 
oncogenic transcription factor c-Myc, suggesting HI 9 
may be an intermediate functionary between c-Myc and 
downstream gene expression [98] . Conversely, the tumor 
suppressor gene and transcriptional activator p53 has 
been shown to down-regulate H19 expression [100,101]. 
HI 9 transcripts also serve as a precursor for miR-675, a 
miRNA involved in the regulation of developmental 
genes [102]. miR-675 is processed from the first exon of 
HI 9 and functionally downregulates the tumor suppres- 
sor gene retinoblastoma (RBI) in human colorectal can- 
cer, further implying an oncogenic role for H19 [103]. 

There is evidence suggesting HI 9 may also play a role 
in tumor suppression [104,105]. Using a mouse model 
for colorectal cancer, it was shown that mice lacking HI 9 
manifested an increased polyp count compared to wild- 
type [106]. Secondly, a mouse teratocarcinoma model 
demonstrated larger tumor growth when the embryo 
lacked HI 9, and finally in a hepatocarcinoma model, 
mice developed cancer much earlier when HI 9 was 
absent [107]. The discrepancy as to whether H19 has 
oncogenic or tumor suppressive potential may be due in 
part to the bifunctional nature of the IncRNA or may be 
context dependent. In either case, the precise functional 
and biological role of HI 9 remains to be determined. 
XIST - X-inactive-specific transcript The 17 kb IncRNA 
XIST is arguably an archetype for the study of functional 
IncRNAs in mammalian cells, having been studied for 
nearly two decades. In female cells, the XIST transcript 
plays a critical role in X-chromosome inactivation by phy- 
sically coating one of the two X-chromosomes, and is 
necessary for the cis-inactivation of the over one thou- 
sand X-linked genes [75,108-110]. Like the IncRNAs 
HOTAIR and ANRIL, XIST associates with polycomb- 
repressor proteins, suggesting a common pathway of 
inducing silencing utilized by diverse IncRNAs. In mice, 
X inactivation in the extraembryonic tissues is non- 



random, and the initial expression of Xist is always pater- 
nal in origin, followed later by random X inactivation in 
the epiblast associated with random mono-allelic expres- 
sion [111]. Intriguingly, regulation of Xist in mouse has 
been shown to be controlled by interactions amongst 
additional ncRNAs, including an antisense transcript, 
Tsix, enhancer-associated ncRNAs (Xite) and the 
upstream Jpx and Ftx IncRNAs [109,112-115]. Intrigu- 
ingly, murine Xist and Tsix duplexes are processed into 
small RNAs by an apparently Dicer-dependent manner, 
suggesting a significant overlap between the regulatory 
networks of both IncRNAs and sRNAs [116]. It is unclear, 
however, how much of this regulation is conserved in 
humans, who do not show imprinted X inactivation 
[117]. While XIST expression levels are correlated with 
outcome in some cancers, such as the therapeutic 
response in ovarian cancer [118], the actual role that 
XIST may play in human carcinomas, if any, is not 
entirely clear. 

There is generally believed to be only a limited develop- 
mental window in which X inactivation can occur, and 
loss of XIST from an inactive X chromosome does not 
result in reactivation of the X chromosome [119,120]. 
Thus, tumors with additional X chromosomes generally 
keep the inactivation status of the duplicated X. For 
tumors in which two active X chromosomes are observed, 
as has been frequently observed in breast cancer, the most 
common mechanism involves loss of the inactive X and 
duplication of the active X, often resulting in heteroge- 
neous XIST expression in these tumors [121-123]. Loss of 
the inactive X and abnormalities of XIST expression may 
be indicative of more general heterochromatin defects 
[124]. Analogously, failure to properly reset XIST and X 
inactivation may serve as a marker of proper resetting of 
epigenetic marks in stem cells or induced pluripotent 
stem cells [125,126]. Correlations between XIST expres- 
sion and cancer state can also occur spuriously. Notably, 
in cytogenetically normal cells, XIST is found only in 
females as males do not have an inactive X. Therefore in 
cancers correlations can be observed with XIST expression 
due to differential presence of male or female samples, 
and many cancers do show different onsets and progres- 
sions in males and females. Furthermore, XIST expression 
will increase with the number of inactive X chromosomes. 
While it might be anticipated that there would be little 
advantage to a tumor acquiring inactive chromosomes, it 
has been shown that approximately 15% of human X- 
linked genes continue to be expressed from the inactive X 
chromosome [127]. 
Involvement In metastasis 

HOTAIR - HOX antisense intergenic RNA Several 
IncRNAs have been implicated in metastasis. One of the 
first IncRNAs described to have a fundamental role in 
cancer was the metastasis-associated HOX Antisense 
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Intergenic RNA (HOTAIR), a 2.2 kb gene located in the 
mammalian HOXC locus on chromosome 12ql3.13 
[18]. This IncRNA was found to be highly upregulated 
in both primary and metastatic breast tumors, demon- 
strating up to 2000-fold increased transcription over 
normal breast tissue [68]. High levels of HOTAIR 
expression were found to be correlated with both metas- 
tasis and poor survival rate, linking a ncRNA with can- 
cer invasiveness and patient prognosis [68]. 
Furthermore, it was demonstrated that if cells expres- 
sing HOTAIR were grafted into mouse mammary fat 
pads, a modest increase in the rate of primary tumor 
growth was observed [68]. Interestingly, there are 
reports indicating that numerous IncRNAs are tran- 
scribed from the HOX locus, suggesting that HOTAIR 
may be only one example of a global regulatory phe- 
nomena [58]. 

The spliced and polyadenylated HOTAIR RNA does not 
encode any proteins but has been demonstrated to be inti- 
mately associated with the mammalian polycomb repres- 
sive complex 2 (PRC2) which is comprised of the H3K27 
methylase EZH2, SUZ12 and EED [68,128,129] (Figure 3). 
Polycomb group proteins mediate repression of transcrip- 
tion of thousands of genes controlling differentiation path- 
ways during development, and have roles in stem cell 
pluripotency and human cancer [68,130-133]. 

HOTAIR binding results in a genome- wide re- targeting 
of the PRC2 complex. The HOXD locus on chromosome 




HOXC Locus 
Chromosome 12 



HOXD Locus 
Chromosome 2 

Metastasis suppressor genes 

Figure 3 Proposed mechanism of HOTAIR mediated gene 
silencing of 40 kb of the HOXD locus, which is involved in 
developmental patterning. The HOTAIR IncRNA is transcribed from 
the HOXC locus and functions in the binding and recruitment and 
binding of the PRC2 and LSD1 complex to the HOXD locus. For 
clarity, only the PRC2 complex is indicated in the above figure. 
Through an undetermined mechanism, the HOTA //?-PRC2-LSD1 
complex is redirected to the HOXD locus on chromosome 2 where 
genes involved in metastasis suppression are silenced through 
H3K27 methylation and H3K4 demethylation. This drives breast 
cancer cells to develop gene expression patterns that more closely 
resemble embryonic fibroblasts than epithelial cells. 



2 is a PRC2 target (Figure 3). The consequence of PRC2/ 
HOTAIR localization is the transcriptional silencing of a 
40 kb region of the HOXD locus which remodels the 
gene expression pattern of breast epithelial cells to more 
closely resemble that of embryonic fibroblasts [68]. The 
HOTAIR RNA appears to act as a molecular scaffold, 
binding at least two distinct histone modification com- 
plexes. The 5' region of the RNA binds the PRC2 com- 
plex responsible for H3K27 methylation, while the 3' 
region of HOTAIR binds LSD1, a histone lysine demethy- 
lase that mediates enzymatic demethylation of H3K4Me2 
[128,134]. Although, the precise mechanism of HOTAIR 
activities remains to be elucidated, it is clear that 
HOTAIR reprograms chromatin state to promote cancer 
metastasis. 

MALAT1 - Metastasis-associated lung adenocarcinoma 
transcript 1 The MALAT1 gene, or metastasis-associated 
lung adenocarcinoma transcript I, was first associated 
with high metastatic potential and poor patient prognosis 
during a comparative screen of non-small cell lung cancer 
patients with and without metastatic tumors [135]. This 
IncRNA is widely expressed in normal human tissues 
[135,136] and is found to be upregulated in a variety of 
human cancers of the breast, prostate, colon, liver and 
uterus [27,137-139]. Notably, the MALA Tl locus at 
llql3.1 has been identified to harbor chromosomal trans- 
location breakpoints associated with cancer [140-142]. 

Intriguingly, cellular MALA Tl transcripts are subject to 
post-transcriptional processing to yield a short, tRNA- 
like molecule mascRNA and a long MALA Tl transcript 
with a poly(A) tail-like moiety [143] (Figure 4). Using the 
mouse homologue to the human MALAT1 gene, it was 




Figure 4 Expression and processing of MALAT1 transcripts. Full 

length 7.5 kb MALAT1 RNA is processed by RNaseP and RNaseZ to 
generate the small ncRNA mascRNA, which is then exported to the 
cytoplasm. The larger MALAT1 RNA is retained in the nuclear 
speckles where it is thought to have a role in regulating alternative 
splicing machinery. 
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revealed that ribonuclease (RNase) P processing gener- 
ates the 3' end of the long MALA Tl transcript and the 5' 
end of the mascRNA. The shorter mascRNA adopts a 
tRNA clover-leaf structure and is subject to RNaseZ pro- 
cessing and the addition of a CCA to its 3' end before 
being exported to the cytoplasm. The generation of puta- 
tively functional sRNAs by post-transcriptional proces- 
sing of IncRNAs, such mascRNA from MALAT1, may 
be reflective of a central, unexplored theme in ncRNA 
biology [144]. Recent evidence suggests regulated post- 
transcriptional processing events, such as splicing and 
post-cleavage capping, may increase the functional 
diversity of the metazoan transcriptome [145-149]. In 
contrast, the long MALAT1 transcript is not polyade- 
nylated, but has a poly(A) tail-like sequence that is 
genome encoded and is putatively present to protect 
the MALAT1 transcript from degradation. Moreover, 
MALAT1 localizes to nuclear speckles in a transcrip- 
tion dependent manner [136,150]. 

Following the correlation between high levels of 
MALA Tl expression and metastasis, a number of studies 
have implicated MALA Tl in the regulation of cell mobi- 
lity. For example, RNA interference (RNAi)-mediated 
silencing of MALA Tl impaired the in vitro migration of 
lung adenocarcinoma cells through concomitant regula- 
tion of motility regulated genes via transcriptional and/or 
post-transcriptional means [151]. Similarly, short hairpin 
RNA inhibition of MALA Tl reduced cell proliferation 
and invasive potential of a cervical cancer cell line [152]. 
Collectively, these studies suggest that MALAT1 regu- 
lates the invasive potential of metastatic tumor cells. 
Other examples of cancer-associated IncRNAs subject to 
post-transcriptional processing include the miR-155 host 
gene BIC and the miR- 17-92 cluster in B-cell lymphoma 
and neuroblastoma, respectively [153,154]. 

Recent efforts have focused on elucidating the molecu- 
lar role of nuclear speckle localized MALAT1 transcripts, 
while the function for the mascRNA has yet to be deter- 
mined. Nuclear speckles are thought to be involved in 
the assembly, modification and/or storage of pre-mRNA 
processing machinery [136,155]. A recent publication 
revealed MALA Tl RNA strongly associates with serine- 
arginine rich splicing factor (SR) proteins which are 
involved in both constitutive and alternative splicing and 
the levels of MALAT1 regulated the cellular levels of 
phosphorylated SR proteins [66]. These findings imply 
that the IncRNA MALA Tl may serve a function in the 
regulation of alternate splicing by modulating the activity 
of SR proteins, although precisely how this may contri- 
bute to tumorigenesis remains unknown. 
LncRNA-mediated tumor suppression by P53 stimulation 
MEG3 - Maternally expressed gene 3 The maternally 
expressed gene 3 (MEG3) was the first IncRNA proposed 
to function as a tumor suppressor. The MEG3 gene is 



expressed in many normal human tissues, with the high- 
est expression in the brain and pituitary gland [156,157]. 
MEG3 expression was not detectable in various brain 
cancers, nor in a range of human cancer cell lines impli- 
cating a potential role of this IncRNA in suppression of 
cell growth. Moreover, ectopic expression of MEG3 
RNA was found to suppress the growth of several 
human cancer cell lines, further supporting the role of 
MEG3 as a tumor suppressor [157]. In clinically non- 
functioning pituitary tumors, it was demonstrated that 
hypermethylation of the MEG3 regulatory region was 
associated with the loss of MEG3 expression, providing 
evidence for a mechanism of MEG 3 inactivation [157]. 

MEG3 is a paternally imprinted, single copy gene com- 
prised of 10 exons [156]. To date, 12 MEG3 isoforms have 
been detected (MEG3; a-3k) due to alternative splicing 
[158]. Each isoform contains the common exons 1-3 and 8- 
10, but varies in the combination of exons 4-7 in the middle 
of the transcript [158]. Notably, the last intron of MEG3 
encodes the evolutionarily conserved miR-770, a miRNA 
with a number of putative mRNA targets [159]. This 
arrangement mirrors the presence of miR-675 in HI 9, 
except in that instance the miRNA was exon encoded 
[102]. The originally identified isoform of MEG3 is 
expressed as a 1.6 kb polyadenylated transcript that is loca- 
lized to the nucleus where it is associated with chromatin 
although some cytoplasmic MEG3 transcripts have been 
detected [156,160,161]. All 12 MEG3 isoforms demonstrate 
three distinct secondary folding motifs designated Ml, M2 
and M3 [158]. 

Functionally, MEG3 has been implicated as a top- 
level regulatory RNA due to its ability to stimulate 
both p53-dependent and p53-independent pathways 
[158,162]. Critically, the M£G3-mediated functional 
activation of p53 is dependent on the secondary struc- 
ture of the MEG3 RNA rather than on primary 
sequence conservation. In an elegant series of experi- 
ments, Zhang et al. demonstrated that replacing parti- 
cular regions of the MEG3 with unrelated sequence 
had no effect on the activation of p53 provided the ori- 
ginal secondary structure was preserved [158]. This 
observation strengthens the argument that the rapid 
evolution and relative lack of sequence conservation 
does not limit the potential functionality of these 
unique RNA species [163]. Conversely, the novel 
HncRNA-p21 has been described as a downstream 
repressor in the p53 transcriptional response, suggest- 
ing the complex p53 transcriptional network includes 
numerous regulatory IncRNAs [164]. 
Depletion of miRNAs by a IncRNA 'molecular decoy' or 
'miRNA sponge' 

Expression of miRNAs in cancer can be deregulated by 
a range of mechanisms, including copy number altera- 
tions and epigenetic silencing [165-168]. Two recent 
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examples have demonstrated that IncRNAs can act as 
natural miRNA sponges' to reduce miRNA levels [169]. 
HULC - Highly Upregulated in Liver Cancer The 
most highly upregulated transcript found in a microar- 
ray-based study of gene expression in hepatocellular car- 
cinoma was determined to be the ncRNA HULC, or 
Highly Upregulated in Liver Cancer [170]. Transcribed 
from chromosome 6p24.3, this IncRNA demonstrates 
the hallmarks of a typical mRNA molecule, including a 
single spliced GT-AG intron, canonical polyadenylation 
signals upstream of the poly(A) tail and nuclear export 
demonstrating strong localization to the cytoplasm. 
Although HULC was found to co-purify with ribosomes, 
no translation product for this IncRNA has been 
detected, supporting its classification as a non-coding 
transcript [170]. In addition to liver cancer, HULC was 
found to be highly upregulated in hepatic colorectal 
cancer metastasis and in hepatocellular carcinoma cell 
lines (HCC) producing hepatitis B virus (HBV) [171]. 

A recent paper began to elucidate the mechanism of 
upregulation of HULC in liver cancer cells, and to pro- 
vide a potential mechanism of HULC function [172]. 
HULC exists as part of an intricate auto-regulatory net- 
work, which when perturbed, resulted in increased 
HULC expression (Figure 5). The HULC RNA appeared 
to function as a 'molecular decoy' or 'miRNA sponge' 
sequestering miR-372, of which one function is the 
translational repression of PRKACB, a kinase targeting 
cAMP response element binding protein (CREB). Once 
activated, the CREB protein was able to promote HULC 
transcription by maintaining an open chromatin struc- 
ture at the HULC promoter resulting in increased 
HULC transcription [172]. 



c ^ 




Figure 5 Proposed mechanism of HULC upregulation in 
hepatocellular carcinoma. (1) The kinase PRKACB functions as an 
activator of CREB. (2) Phosphorylated (activated) CREB forms part of 
the RNA pol II transcriptional machinery to activate HULC 
expression. (3) Abundant HULC RNA acts as a molecular sponge to 
sequester and inactivate the repressive function miR-372. (4) 
PRKACB levels increase, as transcripts are normally translationally 
repressed by high miR-372 levels. 

V / 



Pseudogene pairs: PTEN and PTENP1 The discovery 
that HULC can act as a molecular decoy or 'miRNA 
sponge' mirrors a recent report describing an intricate 
relationship between the tumor suppressor phosphatase 
and tensin homolog (PTEN) and the matching IncRNA 
phosphatase and tensin homolog pseudogene 1 (PTENP1) 
[173]. The RNA transcripts of the P TEN IP TENP1 pair 
share similar 3' untranslated regions (3'UTRs) which both 
bind the same miRNAs. By binding miRNAs, PTENP1 
transcripts reduce the effects of translational repression 
on PTEN therefore allowing expression of this tumor 
suppressor. In cancer, specific mutations inactivate these 
miRNA binding sites in PTENP1, therefore reducing the 
translation of PTEN and promoting tumor growth [173]. 
This is especially pertinent as subtle changes in PTEN 
levels can influence cancer susceptibility [174]. 
LncRNA-mediated deregulation of the tumor suppressors 
ANRIL - Antisense Non-coding RNA in the INK4 Locus 
Located as part of the 42kb INK4b-ARE-INK4a locus on 
chromosome 9p21.3, the Antisense Non-coding RNA in 
the INK4 Locus (ANRIL) is transcribed by RNA pol II 
and processed into alternatively spliced isoforms, includ- 
ing an unspliced transcript of 34.8 kb termed pl5AS 
[175,176]. The INK4b-ARF-INKa locus has an important 
role in cell cycle control, cell senescence, stem cell 
renewal and apoptosis through P14ARF-MDM2-P53 and 
V16Ink4a/pl5Ink4b-Cdk4./6-pRb pathways [177-179]. A 
mouse model suggests that the well-characterized tumor 
suppressor genes encoded within this locus are regulated 
by Polycomb proteins [180]. 

Aberrant expression and single nucleotide polymorph- 
isms (SNPs) within ANRIL have been associated with sus- 
ceptibility to a range of human diseases, including cancer 
[181,182]. Moreover, the INK4b-ARF-INK4a locus is sub- 
ject to frequent deletion or hypermethylation in cancers, 
including leukemia, melanoma, lung and bladder cancers 
[181]. ANRIL has been associated with epigenetic silencing 
of the tumor suppressor gene pI5, although the molecular 
events leading to this silencing were unclear [175]. 

Evidence has suggested an intriguing mechanisms for 
ANRIL-mediated silencing of the INK4b-ARF-INK4a 
locus. Recently an interaction between chromatin modi- 
fying PRC complexes and ANRIL has been described, 
further elucidating the regulatory mechanism of this 
IncRNA. Like the IncRNA HOTAIR which binds both 
the polycomb repressor complex PRC2 and the LSD1 
complex, ANRIL binds and recruits two polycomb 
repressor complexes modifying complexes, PRC1 and 
PRC2 [183,184]. This resulted in ANRIL/PRC mediated 
silencing of the genes in the INK4b-ARF-INK4a locus. 
While the distinct regions of the HOTAIR IncRNA 
required for interactions with each protein complex 
were determined [128], future studies will be necessary 
to elucidate the structural requirements of IncRNAs 



Gibb et al. Molecular Cancer 201 1, 10:38 
http://www.molecular-cancer.eom/content/1 0/1 /38 



Page 9 of 1 7 



such as ANRIL with chromatin regulators such as PRO/ 
PRC2. 

The molecular details of ANRIL-medmted tumor sup- 
pression are becoming more clear. However, other 
mechanisms of IncRNA-mediated suppression of tumor 
suppressor genes have been reported. For example, a 
change in the expression ratio of bidirectional genes has 
been shown to mediate the expression of the tumor 
suppressor p21 [185]. Collectively, these observations 
suggest that IncRNA-mediated silencing of tumor sup- 
pressor genes may be a major mechanism driving 
tumorigenesis. 

Cancer type specific IncRNA expression 

Many of the described IncRNAs are expressed in a vari- 
ety of cancers, however a select few thus far have been 
associated with a single cancer type. HOT AIR, for exam- 
ple, has only been described in breast cancer, while 
three IncRNAs PCGEM1, DD3 and PCNCR1 have been 
associated solely with prostate cancer [68,186,187]. The 
most recently described of these, Prostate Cancer Non- 
Coding RNA 1 (PCNCR1) IncRNA, was identified in a 
'gene desert' on chromosome 8q24.2 and is associated 
with susceptibility to prostate cancer. PCNCR1 is 
expressed as an intronless, -13 kb transcript with a 
potential role in trans-activation of androgen receptor 
(AR), a key player in prostate cancer progression [187]. 
Likewise, PCGEM1 (Prostate Specific Gene 1) was found 
to have properties supporting tumorigenesis, as ectopic 
overexpression of PCGEM1 RNA resulted in increased 
cell growth and colony formation in cell lines [188]. The 
IncRNA Differential Display Code 3 (DD3) is also highly 
over expressed in prostate cancer, yet little is known 
about the role DD3 may play in prostate cancer progres- 
sion [189,190]. Finally, the liver associated IncRNA 
HULC is highly expressed in primary liver tumors, and 
in colorectal carcinomas that metastasized to the liver, 
but not in the primary colon tumors or in non-liver 
metastases [171]. 

RNA polymerase III transcription of IncRNA 

The IncRNAs described thus far are products of RNA 
pol II transcription, yet many ncRNAs are transcribed 
by RNA polymerase III (RNA pol III) [191]. Importantly, 
RNA pol III is frequently deregulated in cancer cells 
resulting in increased activity [192,193]. The molecular 
mechanisms driving increased RNA pol III activity in 
tumor cells include overexpression of RNA pol III tran- 
scription factors, escape from RNA pol III repressors 
and direct oncogene-mediated activation [193-195]. 
Aberrant RNA pol III function may have consequences 
to the expression of IncRNAs transcribed by this 
polymerase. 

For example, the IncRNA BC200 is a small cytoplas- 
mic IncRNA in the neurons of primate nervous systems 
and human cancers, but not in non-neuronal organs 



[20,50,51,196,197]. Unlike the majority of IncRNAs 
described thus far, BC200 is transcribed by RNA pol III 
and shares unique homology with human Alu elements 
[196,198]. Similarly, the IncRNA HULC also shares 
homology with mobile DNA, in this case with a long 
terminal repeat (LTR) retroelement [170]. The BC200 
RNA has been characterized as a negative regulator of 
eIF4A-dependent translation initiation [199]. 

Due to the fact that many whole transcriptome sequen- 
cing methods were developed to enrich for poly(A) puri- 
fied transcripts, RNA pol III transcripts may have been 
excluded from analysis. This suggests that other, yet uni- 
dentified RNA pol III IncRNAs over-expressed in cancer 
may be participating in tumorigenesis. 
Aberrant T-UCR expression in human carcinoma 
Transcribed ultraconserved Regions (T-UCRs) are evolu- 
tionary conserved sequences found in both intergenic 
and intragenic regions of the human genome [200,201]. 
These unique sequences are defined as 481 segments of 
DNA that are absolutely conserved between orthologous 
regions of the human, rat and mouse genomes [201,202]. 
The transcription products of T-UCRs are 200-779 nt in 
length and were originally classed into three categories, 
non-exonic, exonic and possibly exonic, according to 
their overlap with known protein-coding genes [202]. 
More recently, T-UCRs have been re-annotated into a 
more descriptive set of five categories: intergenic (38.7%), 
intronic (42.6%), exonic (4.2%), partly exonic (5%) or exo- 
nic containing (5.6%) (Figure 6) [59]. 

The high degree of conservation of T-UCRs, combined 
with their tissue-specific expression, suggests these 
ncRNAs may play a critical role in cellular metabolism 
and development [200] . The expression of many T-UCRs 
is significantly altered in cancer, notably in adult chronic 
lymphocytic leukemias, colorectal and hepatocellular car- 
cinomas and neuroblastomas [85,200]. Their aberrant 
transcription profiles can be used to differentiate types of 
human cancers and have been linked to patient outcome 
[85]. Some of the T-UCRs reside in genomic regions asso- 
ciated with specific types of cancer [200]. For example, in 
colon cancer, the T-UCR uc. 73A is one of the most highly 
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Figure 6 Genomic locations of the five classes of T-UCRs. The 

exons of coding genes are indicated by boxes, while the locations 

of the T-UCR elements are marked by a double-T bar. The five 
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upregulated T-UCRs, and this IncRNA has been found to 
show oncogenic properties by proliferation assays [200]. 
Similarly, T-UCR uc.338 is significantly upregulated in 
human hepatocarcinoma tumor and cell lines, and uc.338 
was found to be part of a larger transcriptional unit coined 
TUC338 which is involved in cell growth [203]. 

The mechanism by which T-UCRs are differentially 
expressed is unclear. A study profiling T-UCRs in neu- 
roblastomas did not find a consistent association 
between genomic alterations and T-UCR expression, 
suggesting aberrant T-UCR expression may be a conse- 
quence of epigenetic mechanisms. Like miRNAs and 
coding genes, T-UCR expression has been shown to be 
repressed by CpG island hypermethylation [85,204]. 
Similarly, miRNAs have been shown to bind to, and 
downregulate T-UCRs, suggesting a complex regulatory 
mechanism may exist between ncRNAs in human can- 
cers [85,200]. Collectively, these reports have suggested 
that aberrant T-UCR expression may have a yet unde- 
termined biological role in tumorigenesis. 

Utility of LncRNAs in Cancer Diagnostics and Therapies 

Like their smaller non-coding miRNA counterparts, 
IncRNAs represent a significant untapped resource in 
terms of developing diagnostics and therapies. Differential 
or high level expression of certain cancer type-specific 
IncRNAs can be exploited for the development of novel 
biomarkers as IncRNA expression or may potentially 
correlate with patient response to chemotherapy. Under- 
standing the mechanism(s) by which IncRNAs act will 
continue to provide novel approaches to regulating genes 
including the development of mimetics to compete with 
binding sites for miRNAs, chromatin remodelers, or DNA. 
It has been suggested that mediating transcriptional gene 
silencing (TGS) pathways, especially those of tumor 
suppressors or oncogenes, could be of high therapeutic 
benefit [205]. 

It has been widely reported that cancer-specific miR- 
NAs are detectable in the blood, sputum and urine of 
cancer patients [206-210]. Likewise, IncRNAs have 
demonstrated utility as fluid-based markers of specific 
cancers. For example, the prostate specific IncRNA DD3 
has been developed into highly specific, nucleic acid 
amplification-based marker of prostate cancer, which 
demonstrated higher specificity than serum prostate- 
specific antigen (PSA) [211,212], Similarly, the highly 
expressed hepatocarcinoma-associated IncRNA HULC is 
detectable in the blood of hepatocarcinoma patients by 
conventional PCR methods [170]. 

The use of IncRNAs as therapeutic agents is only 
beginning to be explored [213]. Although our under- 
standing of the molecular mechanisms of IncRNA func- 
tion is limited, some features of IncRNAs make them 
ideal candidates for therapeutic intervention. Many 



IncRNAs appear to have protein-binding or functional 
potential that is dependent on secondary structure, this 
may provide a means of intervention [214]. Preventing 
the interactions of HOTAIR with the PRC2 or LSD1 
complexes, for example, may limit the metastatic poten- 
tial of breast cancer cells [215]. 

The tumor expression of certain IncRNAs provides a 
source of regulatory regions that can be used to reduce 
the risk of affecting normal tissues during transgene- 
mediated treatment. For example, HI 9 is strongly 
expressed in embryonic cells and in a wide-range of 
human cancers [20,93,94]. As such, a plasmid-based sys- 
tem has been developed to exploit the tumor-specific 
expression of HI 9, primarily tested in treating bladder 
cancer. A plasmid construct, harboring a diptheria toxin 
gene driven by H19 specific regulatory sequences, is either 
administered via intratumoral injection as naked DNA, or 
complexed to the cationic polymer polyethylenimine (PEI) 
to form a polyplex vector [216]. PEI-complexed plasmid is 
thought to increase the efficiency of DNA uptake 
via clathrin-dependent and -independent (cholesterol- 
dependent) pathways [217]. Upon uptake, high levels of 
diptheria toxin are expressed in the tumor, resulting in a 
reduction in tumor size in human trials [218-221]. Collec- 
tively, these advances indicate the potential in developing 
IncRNA mediated diagnostics and therapies. 

Conclusions 

Differential expression of IncRNAs is becoming recog- 
nized as a hallmark feature in cancer, however the func- 
tional role for the vast majority of these unique genes is 
still in question. In this review, we highlight character- 
ized IncRNAs described to play a functional role in can- 
cer-associated processes, such as metastasis and loss of 
imprinting. Aberrant IncRNA expression participates in 
carcinogenesis by disrupting major biological processes, 
such as redirecting chromatin remodeling complexes or 
inactivating major tumor suppressor genes. We also 
describe the potential role of dysregulated T-UCRs in 
cancer, and place these unique RNAs in the IncRNA 
category. Finally, we note the potential utility of 
IncRNAs in cancer as diagnostic and prognostic mar- 
kers, as well as the potential of developing IncRNA 
mediated therapy. 
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